Neuroimaging Data Releases

The neuroimaging data releases contain the MRI data, physiological data obtained during scanning (cardiac and respiratory) and only basic phenotypic information (age, sex, handedness). The following scans are included for subjects in the Cross-Sectional Lifespan Connectomics Study, Longitudinal Developmental Connectomics Study, and Mapping Interindividual Variation In The Aging Connectome studies:

  • Anatomical (MPRAGE)
  • Diffusion Tensor Imaging (137 Directions)
  • Resting Functional (TR = 645ms, Multiband)
  • Resting Functional (TR = 1400ms, Multiband)
  • Resting Functional (TR = 2500ms)
  • Visual Checkerboard (TR = 645ms, Multiband)
  • Visual Checkerboard (TR = 1400ms, Multiband)
  • Breath Hold (TR = 1400ms, Multiband)

The Real-Time Neurofeedback study includes these scans:

  • Anatomical (MPRAGE)
  • Default Mode Network Training Scan
  • Default Mode Network Testing Scan
  • Moral Dilemma
  • MSIT
  • PEER1
  • PEER2

Downloading Neuroimaging Data Releases from NITRC

Anonymized image data is available in either DICOM or NIfTI file formats on NITRC. Each scan folder also includes a JSON file containing information on scan parameters and file provenance (any changes made to the image since it came out of the scanner).

In order to access NKI-RS neuroimaging data via NITRC you must be:

  1. Registered for NITRC. You may register here if you are not registered already.
  2. Logged into NITRC.
  3. Registered with the 1000 Functional Connectomes Project.

Once these conditions are satisified, you may use the links below to download data. If you are not logged in and properly registered a permissions error will occur.

Downloading Neuroimaging Data Release Data from the FCP-INDI S3 Bucket

Data for the Rockland Sample Neuroimaging Data Release are available for download in an Amazon Web Services S3 bucket. These data are currently organized as follows:

  • RawData : Raw data for Rockland sample releases converted to NifTI format and not separated by release. The directory tree is organized with participants at the root, visits below participants, and individual series below visits.
  • RawDataTars : Raw data compressed with Gzip and separated by release.
  • PhenotypicData : Phenotypic data for participants (i.e., age, sex, handedness) stored in csv files by release.
  • RawDataBIDS : Raw data organized in the BIDS format. Since BIDS makes provisions for phenotypic and data collected during scanning (physiological,event-related), this data is also included in this folder in addition to the MRI series NifTIs. DICOMs are not included. Currently only release 9 is available in BIDS.
  • Outputs : Rockland Sample data processed using C-PAC and the Gibbs’ connectome pipeline.

Each file in the S3 bucket can only be accessed using HTTP (i.e., no ftp or scp ). You can obtain a URL for each desired file and then download it using an HTTP client such as a web browser, wget, or curl. Each file can only be accessed using its literal name- wildcards will not work. We have written a script (described below) that can perform custom dataset downloads for you via HTTP.

There are file transfer programs that can handle S3 natively and will allow you to navigate through the data using a file browser. Cyberduck is one such program that works with Windows and Mac OS X. Cyberduck also has a command line version that works with Windows, Mac OS X, and Linux. Instructions for using Cyberduck are as follows:

  1. Open Cyberduck and click on Open Connection.
  2. Set the application protocol in the dropdown menu to S3 (Amazon Simple Storage Service).
  3. Set the server to s3.amazonaws.com.
  4. Check the box labelled Anonymous Login.
  5. Expand the More Options tab and set Path to fcp-indi/data/Projects/RocklandSample.
  6. Click Connect.

The end result should appear as follows:

_images/rs_cyberduck.png

Using the Downloader Script for S3

The Python script located here allows you to specify a number of options so to download a customized subset of the sample from the S3 bucket. In particular, it allows you to specify:

  • Age Ranges
  • Sex
  • Handedness
  • Study and Visit
  • Scan Type
  • Series
  • Whether or not to download some derivatives of the raw data (such as despiked physiological data collected alongside series or a mask used in the real-time neurofeedback study).

If none of these items are specified, the script will assume that you would like to download all available raw NKI-RS Neuroimaging Data release data without derivatives.

The script requires the following dependencies to function properly:

  • Python 2
  • pandas

To install pandas within Python 2, you can use the following command: pip install pandas

Note: All examples in this section assume some literacy with the Unix command line and that the commands are being executed from the local directory where the script is stored. If you have not used the command line before, a number of tutorials to get you started are linked to here.

Study / Visit Codes

If you want to specify which studies / visits to download, you must take a study code and append the desired visit code to it, and then add this after the -v option. You may specify multiple study/visit code combinations after this option.

  • Real-Time Neurofeedback
  • Study Code: NFB
  • Visit Codes:
  • 2 - The second visit (original 2-day scheduling format).
  • 2R - A re-visit to complete scans for the second visit (original 2-day scheduling format).
  • A - The abridged second visit (abridged scheduling format).
  • AR - A re-visit to complete scans for the abridged second visit (abridged scheduling format).
  • 3 - The third visit.
  • Cross-Sectional Lifespan Connectomics
  • Study Code: DS
  • Visit Codes:
  • 2 - The second visit (original 2-day scheduling format).
  • A - The abridged second visit (abridged scheduling format).
  • Longitudinal Developmental Connectomics
  • Study Code: CLG
  • Visit Codes:
  • 2 - The second visit (original 2-day scheduling format).
  • A - The abridged second visit (abridged scheduling format).
  • 2R - The second visit (original scheduling format) with a re-test.
  • AR - The abridged second visit (abridged scheduling format) with a re-test.
  • 3 - The third visit.
  • 4 - The fourth visit.
  • 4R - The fourth visit with a re-test.
  • 5 - The fifth visit.

Note that in the real-time neurofeedback study, ‘R’ signifies a re-visit, while in the longitudinal developmental connectomics study, it represents a re-test.

For example:

python download_rockland_raw_bids.py -o /data/output -v CLG4 CLG4R CLG5

This will download all of the files for the cross-sectional lifespan connectomics study for the 4th and 5th visits and store them in /data/output in BIDS format.

Scan Types

Multiple scan types, separated by spaces after the scan type parameter, can be selected. These scan types can take on the following values, which come from the BIDS specification:

  • anat - Anatomical scans.
  • func - Functional/resting-state scans.
  • dwi - DTI scans.
  • fmap - Field map scans (NFB3 only)

For example:

python download_rockland_raw_bids.py -o /data/output -t anat dwi

This will download all of the anatomical and dti data only across all visits.

Series Codes

Similarly, multiple series can be specified at the command line. These series can take on the following values:

  • Non-neurofeedback studies:
  • REST645
  • REST1400
  • RESTCAP
  • RESTPCASL
  • CHECKERBOARD645
  • BREATHHOLD1400
  • CHECKERBOARD1400
  • Neurofeedback:
  • DMNTRACKINGTRAIN
  • DMNTRACKINGTEST
  • MASK
  • MORALDILEMMA
  • MSIT
  • PEER1
  • PEER2

For example:

python download_rockland_raw_bids.py -o /data/output -e REST645 DMNTRACKINGTRAIN

This will download only the data for these two series across all participants and visits.

Derivatives

Toggling the derivatives flag will download the mask used for the real-time neurofeedback study and despiked physiological data in addition to any other data that fit the other given criteria.

For example, the following command will download all Rockland sample data plus the aforementioned derivatives:

python download_rockland_raw_bids.py -o /data/output -d

An Elaborate Example

Here is a more complex example of the downloader script’s syntax, which illustrates how you can combine command line options to generate extremely nuanced subsets of the Rockland sample.

python download_rockland_raw_bids.py -o /data/output -e REST645 -v DSA DS2 NFBAR NFB2R -t anat func -gt 15 -lt 40 -m L -x F

This command would download all anatomical series and all resting state series with a 645 ms TR for left-handed women between the ages of 15 and 40, but only from the abridged and second visits of the discovery science and neurofeedback studies. This data could be used to assess within-subject variations in connectivity and neuroanatomy for this specific population, since the first visit for the discovery science and neurofeedback protocols are identical.

To get more information on all of the parameters you may use, type:

python download_rockland_raw_bids.py -h